AP® STATISTICS 
2006 SCORING GUIDELINES (Form B) 


Question 6 


Intent of Question 

The primary goals of this question are to evaluate a student’s ability to apply the concepts of significance testing 
to a new situation, in particular to: (1) state hypotheses for a parameter of interest given a research question; (2) 
recognize that a large sample test should not be used in this setting; (3) identify the possible values of a new test 
statistic and calculate the probability distribution for this new test statistic, assuming the null hypothesis is true; 
(4) use the probability distribution of the test statistic under the null hypothesis to identify possible significance 
levels; (5) conduct a significance test for a small set of data from an initial study; and (6) make a recommendation 
to improve on the initial study. 

Solution 

Part (a): 


Hp : p = 0.5 versus H, : p # 0.5 
Part (b): 

The conditions for the large sample one-proportion z-test are not satisfied. np = nd - p)=8xX05=4<5. 
Part (c): 


X will follow a binomial distribution with n = 8 and p = 0.5. The possible values of X and their 
corresponding probabilities are given in the table below. 





Probability 
0.00391 
0.03125 
0.10937 
0.21875 
0.27344 
0.21875 
0.10937 
0.03125 
0.00391 





























|AJa]}n) BID fro}—|o] 54 








Part (d): 
No, there is no possible test with a p-value of exactly 0.05. 


The probability that none of the individuals (X = 0) or all of the individuals(X = 8) prefer Citrus Fresh 
is 2 X 0.003906 = 0.0078 , which is less than 0.05. 


The probability that one or fewer of the individuals (X < 1) or seven or more of the individuals (X = 7) 
prefer Citrus Fresh is 2 < (0.003906 + 0.031250) = 0.070312, which is greater than 0.05. 
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Question 6 (continued) 
Part (e): 


For the preference data provided, X = 2. From the table of binomial probabilities computed in part (c), the 
probability that two or fewer of the individuals (X < 2) or six or more of the individuals (X = 6) prefer 
Citrus Fresh when p = 0.5 is2 X (0.003906 + 0.031250 + 0.109375) = 0.289062 . Because the p-value of 
0.289062 is greater than any reasonable significance level, say 0.070312, we would not reject the null 
hypothesis that p = 0.5. That is, we do not have statistically significant evidence for a consumer preference 
between Citrus Fresh and Tropical Taste. 


Part (f): 


Increase the number of consumers involved in the preference test. More consumers will give you more data, 
and you will be better able to detect a difference between the population proportion of consumers who prefer 
Citrus Fresh and 0.5. The sample proportion in the initial study was only 0.25 (2/8), but we were not able to 


reject the null hypothesis that p = > By increasing the number of consumers, a difference of that magnitude 


would allow the null hypothesis to be rejected. For example, with n = 80 and X = 20 the large sample 
z-Statistic would be z = Tass = —4.47 and the p-value would be approximately zero. 


80 


OR 


ae p-0.5 ( p- 32) : boas 
The large sample test statistic is z = ———————_ =_ Vn. | ————_}, which would get larger as n is increased, 
ee (0.5)(0.5) 0.5 ar 


n 
for the same p. Thus, by using more consumers it would eventually be possible to reject Hy if there really is 


a difference and identify the preferred juice. 


Scoring 


Parts (a) and (b) are combined and scored as essentially correct (E), partially correct (P), or incorrect (I). Parts (c) 
and (d) are combined and scored as essentially correct (E), partially correct (P), or incorrect (I). Part (e) contains 
two parts, correct mechanics and conclusion, and is scored essentially correct (E), partially correct (P), or 
incorrect (I). Part (f) is scored as essentially correct (E), partially correct (P), or incorrect (I). 


Parts (a) and (b) are essentially correct (E) if both parts are correct. 
Parts (a) and (b) are partially correct (P) if one of the two parts is correct. 
Notes for parts (a) and (b): 


If a one-sided alternative is used in part (a), then the maximum score for the AB component is partially 
correct (P). 
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Question 6 (continued) 


Other reasonable statements about the conditions for inference not being met are acceptable. Some 
examples are: 

o Since n = 8, the number of successes and the number of failures will both be less than 10. 

o The sample size is too small to use the large-sample inference procedure. 

A response in part (b) about a test for comparing proportions from two independent samples (one for 
Tropical Taste and one for Citrus Fresh) should be scored as incorrect (I). 


Parts (c) and (d) are essentially correct (E) if both parts are correct. 
Parts (c) and (d) are partially correct (P) if one of the two parts is correct. 


Notes for parts (c) and (d): 
Part (c) is correct if binomial probabilities are correctly calculated for each of the nine possible outcomes 
using n = 8 and p = 0.5. 
Part (d) is correct if the response includes all three of the following: 
o The student states that a 0.05 level test is not possible. 
o. The justification includes a correct description of the test with significance level 0.00782. 
o The justification includes a correct description of the test with significance level 0.07032. 


Part (e) is essentially correct (E) if both parts are correct. 
Part (e) is partially correct (P) if one part is correct. 


Notes for part (e): 

The mechanics are essentially correct if the student uses the information from the response to part (c) to 

compute the appropriate p-value. 

o Ifa large sample z-test, or a f-test, is presented, the mechanics are scored as incorrect. 

o Using only 0.10937, the probability that X is 2 from the table in part (c), is incorrect. 

To be essentially correct, a correct conclusion must be stated in the context of the problem with linkage 

between a p-value and a level of significance a . 

co If the student refers to the rejection region identified in part (d), they do not need to restate the 
significance level from part (e). 

o If the student conducts an @ = 0.05 level test by comparing the correct p-value to 0.05, even though 
they just said in part (d) that a 0.05 level test is not possible, the response should be scored as 
essentially correct. 


Part (f) is essentially correct (E) if the provides a reasonable recommendation in the context of comparing 
preferences for the two juices with an appropriate statistical justification. For example, the student describes the 
impact of a larger sample size on the hypothesis testing procedure used in the initial study by saying that 
increasing the sample size will reduce the chance of making a Type II error, increasing the sample size will 
increase the power of the test used to detect a difference in juice preferences in part (d), or increasing the sample 
size will reduce the standard error for the estimated proportion of people who prefer Citrus Fresh. 


Part (f) is partially correct (P) if the student provides a reasonable recommendation in the context of comparing 


preferences for the two juices but does not provides sufficient statistical justification that refers to the initial 
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Question 6 (continued) 


study, e.g., a larger sample is needed to meet the conditions for using a z-test, larger samples are more 

representative of the population, or increasing the sample size makes the sample or population more normal 
OR 

Gives a weak recommendation and provides a good statistical justification, for example, providing the subjects 

with a cracker between juices to reduce possible carryover effects. 


Incorrect (I) if the student fails to provide a reasonable recommendation in the context of comparing preferences 
for the two juices. 


Each essentially correct response is worth 1 point; each partially correct response is worth 2 point. 


4 Complete Response 
3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 2! points), use a holistic approach to determine whether 
to score up or down depending on the strength of the response and communication. 
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6A) 
STATISTICS 
SECTION II 
Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II grade—25 — 


Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. Sunshine Farms wants to know whether there is a difference in consumer preference for two new juice products 
—Citrus Fresh and Tropical Taste. In an initial blind taste test, 8 randomly selected consumers were given 
unmarked samples of the two juices. The product that each consumer tasted first was randomly decided by the 
flip of a coin. After tasting the two juices, each consumer was asked to choose which juice he or she preferred, 
and the results were recorded. 


(a) Let p represent the population proportion of consumers who prefer Citrus Fresh. In terms of p, state the 
hypotheses that Sunshine Farms is interested in testing. 


Ho: ptr 
Pa e% 


(b) One might consider using a one-proportion z-test to test the hypotheses in part (a). Explain why this would 
not be a reasonable procedure for this sample. 


A one pr porhen 2-tevt eueeve nok he Agernpri att. he case 
the sample sine toe small and the shandard denahon 


1S anki , 


GO ON TO THE NEXT PAGE. 
-14- 
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(c) Let X represent the number.of consumers in the sample who prefer Citrus Fresh. Assuming there is no 


difference in consumer preference, find the probability for each possible value of X Record the x-values 
and the corresponding probabilities in the table below. 


(eee) 
«0039 9625 
me iks 
ADTs 
2V878 
ZIBH4RIS 
UTS 
LOSsts 
OUNZS 
99390625 














ba] 





eo Se ALWP-O 





(d) When testing the hypotheses in part (a), Sunshine Farms will conclude that there is a consumer preference if 
too many or too few individuals prefer Citrus Fresh. Based on your probabilities in part (c), is it possible for 
the significance level (probability of rejecting the null hypothesis when it is true) for this test to be exactly 
0.05 ? Justify your answer. ~ 

We lee cose o's sc | a discrete candor verialle F Tea order 
for the significance \ewl oe bee exactly 0.08 Caccerding 4 
athe table above), rhe valve af ~% would ave to be 
’ 
Yetween “To and 8 an d thee ae no valres hetween 
é : r . 


ae | and s ‘ 


GO ON TO THE NEXT PAGE. 
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(e) The preference data for the 8 randomly selected consumers are given in the table below. 


|__| Tropical Taste_| 
| —2,__Citrus Fresh 
ay eal Te 
| 5___| Tropical Taste _| 
| 6 | Citrus Fresh _| 
| 8 | Tropical Taste_| 


Based on these preferences and your previous work, test the hypotheses in part (a). 


4, geaele preter Cams fresh, thas X* 2. “The 


L. LA: heb one obseves a value ads least as 
vw a “ 

ret Ae ye ¥ othe! r ‘ “e 
exirews as ah: a assum = t AP His ae , ae al Ss 


e(xer)s Peet) + ele=2) = ANd 



















Recance awe 8 @ frorrailed Hest, the grvalke cs 
CRY 


devel, . thas 2 
o> 2(.suas)= .2899 


A sianih cance 
Mad mere exists 


Gy 


level o£ .2890 is insufGuens +e concluce. 
a ere ference. for either we product. 


GO ON TO THE NEXT PAGE. 
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a oAul 
(f) Sunshine Farms plans to add one of these two new juices—Citrus Fresh or Tropical Taste—to its 
production schedule. A follow-up study will be conducted to decide which of the two juices to produce. 
Make one recommendation for the follow-up study that would make it better than the initial study. Provide a 
statistical justification for your recommendation in the context of the problem. 


One Suage cho would lye ts signAicrnct lay increase he. 
nua bese of Cages sampled. aes w Fe 30, aren tz 


2-xesk kd le used. A larger Sarmele. size would algs 


Lancet Xe phe Gower” of phe dest ; mereasing, phe. Srvbals’ Neg 


Prod Me null hy pathesis wo\\ \se cee ete ule A are. Leise, 
aus: alawng Sunshine Farms An reach Oy conclusion 


as Ao which pares to groduce : 


STOP 


END OF EXAM 





THE FOLLOWING INSTRUCTIONS APPLY TO THE COVERS OF THE 
SECTION Il BOOKLET. 


e MAKE SURE YOU HAVE COMPLETED THE IDENTIFICATION 
INFORMATION AS REQUESTED ON THE FRONT AND BACK 
COVERS OF THE SECTION II BOOKLET. 


e CHECK TO SEE THAT YOUR AP NUMBER LABEL APPEARS IN 
THE BOX(ES) ON THE COVER(S). 


® MAKE SURE YOU HAVE USED THE SAME SET OF AP 
NUMBER LABELS ON ALL AP EXAMS YOU HAVE TAKEN 
THIS YEAR. 
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bE 
STATISTICS ; 
SECTION II 
Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II grade-—25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 






6. Sunshine Farms wants to. know.whether there isa a difference) 
—~Citrus Fresh and | Tropical Ta Taste. In an initial blin taste te: 2 elect p} given 
unmarked sainples ofthe 8 two vo juices. The > product that each con: imer tasted “4 first v was as randomly decided by the 


flip of a coin’ After tasting the two juices, each ¢ consumer was asked to choose which juice | he or, she preferred, 
_“and the res@ilts were recorded, 





(a) _Let p represent the population proportion of consumers who prefer Citrus Fresh, In terms of s of p, State the _ 
_hypothesés t that Sunshine Farms i is interested in in testing. 


Ho P=Oh. 


(b) One might consider using a one-proportion z-test to test the hypotheses in part (a). Explain why this would 
not be a reasonable procedure for this sample. 


Bec +h€ SOMNPL WZE is nor Large 
CHOU i} 


NO = (9)(0H) =A <F (xX) 


GO ON TO THE NEXT PAGE. 
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(c) Let X_represent the number of consumers in the sample who prefer Citrus Fresh. Assuming there i is.no 


difference in consumer - preference, find the probability for each possible value of X. Record the x- sO. 
“and the corresponding probabilities in the table below. “— T yoauid USe 


binomial +heoren’ 
COUSE 
> +nere ere 1 AVidls 
weare Two 
; Ten ole ost CON 
3) 7WO ae Ofe. 


ranger cle rt 











_ the Serenays level (probability | ‘of re ‘rejecting t the ‘full hypothesis | when, it is true) f for this test’ ‘to be « exactly 
a: 05? Justify y your: answer. 


‘Wo, ype Sums of the probavilites of 
| mas prefemiN®  parncurar OLA 
does wr (each O04. 


Thove Are ve COnnut be COCKY, 0.05, 


Consu 
either exceeds OF 
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(e) The preference data for the 8 randomly selected consumers are given in the table below. 






Individual | Juice Preference | 


q 








a 


Why 


| Citrus Fresh 
| (73 | Tropical Taste 
: 
[ Tropical Taste | 


Based on these preferences and your previous work, test the hypotheses in part (a). 


The PobaclitTy that 2 Consumers WOLd 
aefer citrus Fresn when there isl OMUINeY 
peferene \S 0, | 


| G31h. 
Twwld USE A=j]OOR , 0109377605 
Because the p-value is greater ran J, 
there is not € Ndugn evidena to reject 
tho null hy puttest'S that share 1S NO 
—cureumer prererenca. 





‘ 


~~] 
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(f) Sunshine Farms plans to add one of these two new juices-—Citrus Fresh or Tropical Taste—to its 6 8 4 
production schedule. A follow-up study will be conducted to decide which of the two juices to produce. ' 

Make one recommendation for the follow-up study that would make it better than the initial study. Provide a 
statistical justification for your recommendation in the context of the problem. 


ge o sample with larcer camp size. 
Larae sample size will “Allon (eKOtaners 
Lo use prmportiol z-+test and reduce. 


we Confidence Wrrecvais 
Because Siaidard cesiccion OF propetan 


is PERT jarger samp siee will ee 
} 
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aut 4% 0? noe ips. 


STOP 


END OF F EXAM 


_ THE FOLLOWING INSTRUCTIONS APPLY TO THE COVERS OF THE 
SECTION tl BOOKLET. 


° MAKE SURE YOU HAVE COMPLETED THE IDENTIFICATION 
INFORMATION AS REQUESTED ON THE FRONT AND BACK 
COVERS OF THE SECTION Il BOOKLET. 


- @ CHECK TO SEE THAT YOUR AP NUMBER LABEL APPEARS IN 
THE BOX(ES) ON THE COVER(S). 


e MAKE SURE YOU HAVE USED THE SAME SET OF AP 
NUMBER LABELS ON ALL AP EXAMS YOU HAVE TAKEN 
THIS YEAR. 


© 2006 The College Board. All rights reserved. 
Visit apcentral.collegeboard.com (for AP professionals) and www.collegeboard.com/apstudents (for students and parents). 


oC] 


STATISTICS 
SECTION If 
Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
. Percent of Section If grade —25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the | 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. Sunshine Farms wants to know whether there is a difference in consumer preference for two new juice products 
—cCitms Fresh and Tropical Taste. In an initial blind taste test, 8 randomly selected consumers were given 
‘unmarked samples of the two juices. The product that each consumer tasted first was randomly decided by the 
flip of a coin. After tasting the two juices, each consumer was asked to choose which juice he or she preferred, 
and the results were recorded. ; 


(a) Let p represent the population proportion of consumers who prefer Citrus Fresh. In terms of p, state the 
hypotheses that Sunshine Farms is interested in testing. 


: ? : : t ul F : | 
ae i a Ae reference of 
My (| 4 ae Thee t 9 dfweds iw ty, the two 
| tf, , Rey Su 
Me Mevemce Ae ans 


Products 
Ney jAice pre badd, 


(b) One might consider using a one-proportion z-test to test the hypotheses in part (a). Explain why this would 
not be a reasonable procedure for this sample. 
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(c) Let X represent the number of consumers in the sample who prefer Citrus Fresh. Assuming there is no 
difference in consumer preference, find the probability for each possible value of X. Record the x-values 
and the corresponding probabilities in the table below. 


0 LI445 | 
Oylo 4a 
Ba O2 NTs 


40.00 34 





(d) When testing the hypotheses in part (a), Sunshine Farms will conclude that there is a consumer preference if 
too many or too few individuals prefer Citrus Fresh. Based on your probabilities in part (c), is it possible for 
the significance level (probability of rejecting the null hypothesis when it is true) for this test to be exactly 
0.05 ? Justify your answer. 


yes / bot very walibely 
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(e) The preference data for the 8 randomly selected consumers are given in the table below. 


[Individual | Juice Preference | 
| 
| 6 | Citrus Fresh | 

7 
| 8 | Tropical Taste 


Based on these preferences and your previous work, test the hypotheses in part (a). 
gore. as 
fs 0.0455 
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@cy 
(f) Sunshine Farms plans to add one of these two new juices—Citrus Fresh or Tropical Taste—to its 
production schedule. A follow-up study will be conducted to decide which of the two juices to produce. 
Make one recommendation for the follow-up study that would make it better than the initial study. Provide a 
Statistical justification for. your recommendation in the context of the problem. 
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END OF EXAM 
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® CHECK TO SEE THAT YOUR AP NUMBER LABEL APPEARS IN 
THE BOX(ES) ON THE COVER(S). 


@ MAKE SURE YOU HAVE USED THE SAME SET OF AP 
NUMBER LABELS ON ALL AP EXAMS YOU HAVE TAKEN 
THIS YEAR. ; 
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2006 SCORING COMMENTARY (Form B) 


Question 6 


Sample: 6A 
Score: 4 


This essay recognizes that a sample of eight subjects is too small to use a z-test and accurately applies the small 
sample binomial test suggested in the question. The appropriate null and alternative hypotheses are identified in 
part (a). Part (b) is correct, but it would be strengthened by including a statement that for p = 0.5 and n = 8, 


both np and n( — p) are too small to use a one-sample z-test. Binomial probabilities with p = 0.5 and n = 8 


are accurately computed for the nine possible outcomes in part (c), and part (d) recognizes that a significance 
level of 0.05 cannot be achieved. This essay uses the table of probabilities computed in part (c) to obtain the 
correct p-value and provides an appropriate conclusion in the context of the juice-tasting study. Increasing the 
number of consumers randomly selected for the study is the improvement discussed in part (d); and the resulting 
increase in the power of the test to reject the null hypothesis of equal preference, when it is false, provides the 
requested statistical justification. The additional note that a larger sample size would justify the use of a z-test is 
also a valid point, but it is not as important as the increase in the power of the test (or the decrease in the standard 
error of the estimated proportion) provided by the increased sample size. 


Sample: 6B 
Score: 3 


In parts (a) and (b) this essay correctly identifies the null and alternative hypotheses and shows that the sample size is 
too small to use a z-test. Binomial probabilities are correctly computed in part (c), but in part (d) it is not clear if the 
student is thinking about the sums of tail probabilities that correspond to possible type I error levels of 0.0079 and 
0.0703. Realizing that the sample size is too small to apply a z-test from the previous parts of the problem, this essay 
refers to the table of binomial probabilities in part (c) but fails to present the appropriate p-value. The essay only 
displays the probability that exactly two consumers would prefer Citrus Fresh instead of the sum of the appropriate 
tail probabilities. An appropriate conclusion is reached, however, in the context of the problem. Part (d) provides 
statistical justification for increasing the sample size by addressing the issue of reducing the standard error of the 
estimated proportion and decreasing confidence interval width. It also addresses the issue raised in part (b) of 
obtaining a large enough sample size to use a large sample z-test. Since p = 0.5 for the null hypothesis, a sample 


size of n = 20 is needed to satisfy the common rule that both np and n(1 — p) are larger than 10. 


Sample: 6C 
Score: 2 


This essay correctly identifies the appropriate null and alternative hypotheses in part (a) but inappropriately 
suggests a two-sample z-test in part (b). Binomial probabilities are correctly computed in part (c), but an incorrect 
answer with no explanation is given in part (d). Although no work is shown, the value of an inappropriate two- 
sample z-test is presented in part (e) along with the p-value for that test. The conclusion is incorrect for the given 
data, but it is consistent with the p-value for the inappropriate two-sample z-test, and it is stated in the context of the 
juice-tasting study. Although it could be communicated better, the response to part (f) addresses the concept that 
increasing the sample size will decrease the standard deviation of the estimated proportion and lead to more precise 
inference about the proportion of the population that prefers Citrus Fresh. 
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